On the Testing and Estimation of High-Dimensional Covariance Matrices

نویسندگان

  • Chanseok Park
  • Elizabeth Moser
چکیده

Many applications of modern science involve a large number of parameters. In many cases, the number of parameters, p, exceeds the number of observations, N . Classical multivariate statistics are based on the assumption that the number of parameters is fixed and the number of observations is large. Many of the classical techniques perform poorly, or are degenerate, in high-dimensional situations. In this work, we discuss and develop statistical methods for inference of data in which the number of parameters exceeds the number of observations. Specifically we look at the problems of hypothesis testing regarding and the estimation of the covariance matrix. A new test statistic is developed for testing the hypothesis that the covariance matrix is proportional to the identity. Simulations show this newly defined test is asymptotically comparable to those in the literature. Furthermore, it appears to perform better than those in the literature under certain alternative hypotheses. A new set of Stein-type shrinkage estimators are introduced for estimating the covariance matrix in large-dimensions. Simulations show that under the assumption of normality of the data, the new estimators are comparable to those in the literature. Simulations also indicate the new estimators perform better than those in the literature in cases of extreme high-dimensions. A data analysis of DNA microarray data also appears to confirm our results of improved performance in the case of extreme high-dimensionality.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Estimating Structured High-Dimensional Covariance and Precision Matrices: Optimal Rates and Adaptive Estimation

This is an expository paper that reviews recent developments on optimal estimation of structured high-dimensional covariance and precision matrices. Minimax rates of convergence for estimating several classes of structured covariance and precision matrices, including bandable, Toeplitz, and sparse covariance matrices as well as sparse precision matrices, are given under the spectral norm loss. ...

متن کامل

Covariance Matrix Estimation in Time Series

Covariances play a fundamental role in the theory of time series and they are critical quantities that are needed in both spectral and time domain analysis. Estimation of covariance matrices is needed in the construction of confidence regions for unknown parameters, hypothesis testing, principal component analysis, prediction, discriminant analysis among others. In this paper we consider both l...

متن کامل

Structure of Wavelet Covariance Matrices and Bayesian Wavelet Estimation of Autoregressive Moving Average Model with Long Memory Parameter’s

In the process of exploring and recognizing of statistical communities, the analysis of data obtained from these communities is considered essential. One of appropriate methods for data analysis is the structural study of the function fitting by these data. Wavelet transformation is one of the most powerful tool in analysis of these functions and structure of wavelet coefficients are very impor...

متن کامل

Regularized Estimation of High-dimensional Covariance Matrices

Regularized Estimation of High-dimensional Covariance Matrices

متن کامل

Rate Optimal Estimation for High Dimensional Spatial Covariance Matrices

Spatial covariance matrix estimation is of great significance in many applications in climatology, econometrics and many other fields with complex data structures involving spatial dependencies. High dimensionality brings new challenges to this problem, and no theoretical optimal estimator has been proved for the spatial high-dimensional covariance matrix. Over the past decade, the method of re...

متن کامل

Inference for high-dimensional differential correlation matrices

Motivated by differential co-expression analysis in genomics, we consider in this paper estimation and testing of high-dimensional differential correlation matrices. An adaptive thresholding procedure is introduced and theoretical guarantees are given. Minimax rate of convergence is established and the proposed estimator is shown to be adaptively rate-optimal over collections of paired correlat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009